An Automated Evaluation Metric for Chinese Text Entry

نویسندگان

  • Mike Tian-Jian Jiang
  • James Zhan
  • Jaimie Lin
  • Jerry Lin
  • Wen-Lien Hsu
چکیده

In this paper, we propose an automated evaluation metric for text entry. We also consider possible improvements to existing text entry evaluation metrics, such as the minimum string distance error rate, keystrokes per character, cost per correction, and a unified approach proposed by MacKenzie, so they can accommodate the special characteristics of Chinese text. Current methods lack an integrated concern about both typing speed and accuracy for Chinese text entry evaluation. Our goal is to remove the bias that arises due to human factors. First, we propose a new metric, called the correction penalty (P), based on Fitts' law and Hick's law. Next, we transform it into the approximate amortized cost (AAC) of information theory. An analysis of the AAC of Chinese text input methods with different context lengths is also presented.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mobile Phone Keypad Design for Fast Chinese Text Entry by Phonetic Spelling

The trend of using text messaging on mobile phones has grown rapidly in the last decade. However, mapping the alphabet to twelve phone keys introduces challenging ambiguities for text entry. This challenge is exacerbated in Chinese by the large phonetic alphabet and homophonic Chinese characters. In response, we propose a novel algorithm to generate keypad layouts that reduce ambiguity for Chin...

متن کامل

A New Psychometric-inspired Evaluation Metric for Chinese Word Segmentation

Word segmentation is a fundamental task for Chinese language processing. However, with the successive improvements, the standard metric is becoming hard to distinguish state-of-the-art word segmentation systems. In this paper, we propose a new psychometric-inspired evaluation metric for Chinese word segmentation, which addresses to balance the very skewed word distribution at different levels o...

متن کامل

SCESS: a WFSA-based automated simplified chinese essay scoring system with incremental latent semantic analysis

Writing in language tests is regarded as an important indicator for assessing language skills of test takers. As Chinese language tests become popular, scoring a large number of essays becomes a heavy and expensive task for the organizers of these tests. In the past several years, some efforts have been made to develop automated simplified Chinese essay scoring systems, reducing both costs and ...

متن کامل

Stanford University’s Chinese-to-English Statistical Machine Translation System for the 2008 NIST Evaluation

This document describes Stanford University’s first entry into a NIST MT evaluation. Our entry to the 2008 evaluation mainly focused on establishing a competent baseline with a phrase-based system similar to (Och and Ney, 2004; Koehn et al., 2007). In a three-week effort prior to the evaluation, our attention focused on scaling up our system to exploit nearly all Chinese-English parallel data p...

متن کامل

Overview of the IWSLT 2005 evaluation campaign

This paper reports an overview of the evaluation campaign results of the IWSLT 2005 workshop 1 . The BTEC corpus, which consists of typical travel domain phrases, was used. Data for the five language pairs Arabic/Chinese/Japanese/Korean to English and English to Chinese was prepared. To study how much the amount of the training data and how much different training and decoding approaches contri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/0704.3662  شماره 

صفحات  -

تاریخ انتشار 2007